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TEACHING FOR MASTERY: THE ROLE OF CLASSROOM TESTING 

by Rebecca M. Valett- 

Part I. The New Role of Testing in Foreign Language Instruction 



We language teachers must face up to an unpleasant truth: most 

of the students who begin language study in our classes fail to attain 
a level of basic competence in a second language. Language study is 
often considered simply as an unpleasant hurdle standing between the 
student and college entrance or the acquisition of an advanced 
academic degree. In fact many language teachers and potential 
teachers graduating from our colleges have but an uneasy and rather 
weak command of the language they profess to teach. 

When forced to confront our failure, we frequently seek refuge 
behind four "excuses’: 

1. But look at my star pupils X and Y: they have attained 

near-native fluency even though they have not had the oppor- 
tunity to travel abroad." Unfortunately the existence of 
n handful of successful students in no way compensates for 
our failure to teach the remaining 95 per cent. Tnat success- 
ful handful would probably have been able to attain that level 
of fluency without us. And furthermore, nobody doubts that 
language teachers have occasionally transmitted their knowledge 
to a select few, or else they would never have been able to 
perpetuate the priesthood of language teachers. 



y. Many students Just haven't received the proper prepara- 
tion before they enter my class. They don't know how to 
study. They don't even knovi anything about their own language. 
Of course* none of us would deny the benefits of solid academic 
background for each of our students. But the whole concept 
of education is based on the promise that the teacher accepts 
the raw material he is given, that is, the students with 
their strengths and with their weaknesses, ond brings these 
students several stops forward on the path of learning. This 
is the challenge of teaching. 




3. 'Students haven't been able to learn the second language 
because they have been taught by Method X. Were wo all to 
use Method V, then our students would really be able to 
master the language." This excuse is widespread and the name 
of ,'uiy method (traditional, audio-lingual, direct, etc.) nay 
be inserted in slots X and Y depending, on who is making, tne 
statement. Every few years the vogue changes and teachers 
clamber on a new bandwo ton, confident that salvation (that 
is, success in teaching a language) is around the corner. 

Yet the method, or itself, is only one factor in student 
.learning: some students have been successfully taught by 
each of the current methods, but most students have typically 
experienced failure, 

/I . • A lot of my students simply lack language aptitude. " Tills 

argument is probably the most pernicious, and even though the 
special Ian; ua e i ft ' myth and its corrollary the language 
block” have never loon corroborated by research, many students, 
administrators, and teachers keep pretending they exist. It's 
a fine out for all concerned, for Johnny's failure to learn 
is then neither tue teacher's fault, nor the schools fault, 
nor iiis own fault: his failure was inevitable. 
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Carroll (1962), as a result of his research on prognostic 
testing, developed a language learning model in which success 
was the function of five factors: instructional factors-- 

presentation of material (text, teacher, etc.) time allowed 
for learning; student factors--general intelligence (i.e., 
ability to follow instructions), motivation (degree of per- 
severancy), aptitude (time needed for learning). 

It is interesting to note here that aptitude is not a "gift" 
but an individual learning rate: in other words, all students 
(except the mentally deficient) can learn a second language, 
but some will learn more rapidly than others. Bloom ( 196 b) 
has taken Carroll's model and placed the emphasis on success. 

He insists that we must change our instructional system so 
that we lead each student through a sequence of successful 
learning experiences: we must vary the types of presentations 
and the time allowed for learning so as to permit all students 
to attain a degree of mastery. 

Teaching for mastery requires a new kind of testing. 

First, let us recall Glaser’s ( 1963 ) distinction between two 
types of tests: the norm- referenced test and the criterion-referenced 
test . The norm- referenced test" lias been with us for decades: 
examples are the College Entrance Examination Board Achievement 
Tests, the MLA Coop Tests, the Pimsleur Proficiency Tests, the Com- 
mon Concepts Test, the MJA Proficiency Tests for Tdachers and Advanced 
Students, and our own homemade final examinations , The norm-raferenced 
test, as tiie name indica'ces, reports student scores with reference 
to a norm: scores may be converted to letter grades, standard scores 
(e.g. , 200-H00) , stanines or percentile band3. The norm- referenced 
test lets the examiner know whether Johnny has done better than 
Susie, how Johnny stands with respect to other Level One students 
in traditional classes, and how Johnny's class compares with other 
classes in the city, state or nation. The criterion-referenced 
test, on the other hand, reveals how much Johnny knows with reference 
to the criterion or some absolute standard, such as a Hat of vocab- 
ulary items, structures using, the subjunctive, or rate of delivery 
as a function of listening comprehension. Classroom quizzes are 
often criterion-referenced tests. 

In constructin’, a norm-referenced test, the examiner is extremely 
concerned about item difficulty: there must bo the proper mix of 
average item3, very difficult items and easier Items so the text 
will spread the student scores over a broad range. Ideally, if 
the teacher must administer an c .d-of-the-year test in order to 
assign grades of A, R, C, 1) and F, he should prepare a test of a 
difficulty level such that scores spread out over a range of let's 
say Id to 99, with most scores grouping around 90 or 60. In building, 
a criterion-referenced test, however, the examine)' has n totally 
different concern: he carefully establishes tils criterion (for 

example, by analyzing all of the features of a given unit) and 
writes items to test (or sample) the various aspects of that 
criterion. It matters not at all whether the items are difficult 
or easy. The teacher administering such a test Is delighted to 
nave the entire class score 99 - 1 00 per cent correct, for that is 
an indication that the content of the unit has been mastered, 

O 
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What does criterion-referenced testing, or mastery testing, 
offer the classroom teacher? This is a relatively new field and 
the pioneer work is being done in California. Two experiments are 
of relevance here. Newmark and Sweigert (1966) report on a re- 
search project in which three different teaching methods (class- 
room instruction, TV instruction and programmed Instruction) were 
being evaluated. Since the programs each contained somewhat different 
objectives, and since there were differences in the lexicon and gram- 
mar being taught, the research team decided to construct three different 
criterion- referenced tests each based on the content and objectives 
of one of the teaching methods. A striking finding of this project 
was that, regardless of method, students were not mastering, the ob- 
jectives of the language course in which they were enrolled. All 
three methods were more or less producing failures. The one posi- 
tive feature of the study, however, was that the team established 
the feasibility of using, criterion- ref erenced tests c.n a large scale. 

Smith (196b) directed another experiment in which a team sought 
to determine whether criterion-referenced tests mi ht not be effectively 
used to improve classroom instruction during the school year.. Since 
toe results of this piece of research bear directly on our problem 
(how to bring, more students to the point where they truly learn a 
second language), we shall describe the findings in some detail. 

Teachers of sixth-grade Spanish (all using A-LM materials) 
were divided into three groups, before and after each unit of in- 
struction, all classes were given a c riterlon- ref erenced listening- 
comprehension test based on the material in the text. Teachers in 
Group 3 (No Responsibility ) were not informed of the results of 
the test and proceeded at thei: 1 own individual pace. Teachers in 
Group 2 (Informed but not Responsible) were given the tost results 
but were not given special instructions about how to act on the 
Information; typically they went on to the next unit once the final 
c rl teri on- ref erenced test had been administered. Teachers in Group 1 
(Specific Responsibility) were not only told the test results, but 
were not permitted to continue to the next unit until 90 per cent 
of the students were able to respond correctly to »0 per cent of the 
items. At the end of the year, Groups 2 and 3 bod finished three units 
whereas Group 1 was only halfway through unit three. All groups were 
administered the final unit three test, and Group 1 performed signifi- 
cantly better than the other two groups and made the highest gains 
between the pretest and the posttest. The report concludes: "Using 

criterion-referenced tests to indicate pupil achievement of specific 
objectives and teaching to thes e objectives is significantly better 
way of teaching than: (a)using criterion-referenced test to indicate 
pupil achievement but not teaching, to specific objectives; (b)not 
using, criterion-referenced tests and not being, responsible for specific 
objectives.' Results of the posttost showed that: Scores above bh 

per cent increased tenfold when teachers were specifically responsible; 
failures were reduced by per cert when teachers were specifically 
responsible. Individual gains were 33 per cent higher when teachers 
were specifically responsible." Finally, it wan determined that 
teachers who are held specifically responsible for the performance 
of their students become 1,6 times more effective in teaching the 
foreign language. 

We now have an answer to their first part of the topic question. 
Teachers should measure language learning before continuing, to the 
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next chapter or unit. Unless a specified level of mastery has 
been reached (e.g., 90 per cent of the students answering 
Ho per cent of the items correctly), the teacher must diagnose 
the weaknesses of his students and use the variety of teaching 
techniques at his disposal to bring the class to mastery. For 
this he is sole responsible. The administrator, on his side, 
must realize that although progress through the "hook" will be 
slower, the students will actually be learning more. (Let us 
present an analogy here: formerly the language learner was like 
a child wading in a pond. As he progressed the water wnich at 
first swirled around his feet, gradually came up to histoees, 
his waist, his chest, his neck until he either succumbed or turned 
hack. The vocabulary and structures he failed to master in each 
lesson accumulated so that ultimately he dropped out or failed 
out. Tne student who learns a language in a classroom where the 
teacher uses criterion-referenced tests and assumes specific 
responsibility for the attainment of the course objectives is 
like the child wading along the beach. Sometimes the water pushes 
up to his knees, but slowly the level recedes to ankle depth so 
that while the good students is usually on dry ground, the slower 
student still feels in complete control of where he is waiting,) 

Part II: A Professional Attitude 



before discussing specific ways in which a erite rion- ref erenced 
testing program may be Implemented, I should like to take a few 
moments to stress the Importance of the teacher, and especially 
the importance of the teacher 1 e mental set. 

If we expect all of our students to master the Frencli ”r", 
they usually do, whereas if another teacher Is convinced that 
most students will never get it, his students usually dorft. 

Recently research by Rosenthal and Jacobsen (196b) confirmed the 
self-fulfilling nature of teacher expectancies. (in September 
the teachers of a California elementary school were told that 
certain students would probably spurt ahead in intelligence as 
signaled by their performance on a new type of 'prognostic ' intel- 
ligence test administered the previous spring. Actually the test 
had no predictive qualities and the names of the potential "spurters” 
had been chosen at random. In May of the academic year, an adminis- 
tration of the same intelligence revealed that the "spurters 1 
actually showed significant increases in intelligence, and that 
these increases were particularly marked in the case of the ‘more 
Mexican children. Apparently the teachers must have first been 
surprised to note the names of some of the "spurious 11 and then in 
some way unknowingly communicated to the children that they antici- 
pated improved intellectual performance, ) Were language teachers 
to expect all students to master a second language, we might well 
experience greater success. 

Some of you are thinking: that all sounds very nice but... 

One teacher specifically asked me: All right, but how do I know 

when to flunk a studentv To answer this question, I must go back 
to the word "professional’ . A medical doctor is a professional 
ran. When you go to a doctor you expect professional help. 

Part III: Determining Objectives: the Audio-Lingual Skills 




Li r tening. 

1 , global comprehension 

listening for vocabulary 
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3. listenin' -1 ; for structure signals 

4. listening for information 

9. total comprehension 

R. Speaking 

1. reciting from memory 

P. producing vocabulary 

3. producing structure signals 

4. producing specific information 

9 . “free" speech 

part IV: Teaching for Mastery 

In the larger school, criterion- ref erenced testing and the 
assumption of specific responsibility can be paired with a track 
system. The better students move ahead as a group (90 per cent 
of the students mastering at least bO per cent of the material) 
while t lie slower students progress at a rate suited to their 
learning speed. Hernick and Kennedy (190^) report that an effective 
tracking system of multi-level grouping can reduce attrition 
and increase student motivation. 

As the school system develops a set of c riterion-referenced 
tests to accompany its teaching materials, these tests may be used 
as placement tests for Incoming students. Students who score 
midway between the starting points of ongoing classes should be 
afforded individualized instruction to enable them to catch up 
with the appropriate class. 

If a school system were to begin implementing the policy of 
teacher responsibility for specific objectives, the first text to 
study vrould be Mager (196*?). A table of foreign language objectives, 
which might serve as a guideline, will shortly appear in Volette ( 19^9) • 
Once tiie minimum objectives have been determined, and once the con- 
tent of the adopted texts has been analyzed, items must be written. 

Ideas for item form and style may be found in Lado {196*0 or Valette( IQbY 
Perhaps the school may wish to focus simply on one objective (such as 
listening, comprehension, as was the decision of the Stanislaus County 
teachers): other objectives form the basis of day-by-day classroom 
instruction, but progress from one unit to the next is dependent 
on the class mastery of lesson content via aural comprehension. 

Probably n school system would experiment with specific responsi- 
bility and mastery testing with a few sections the first year, and, 
if successful, extend the approach to other courses In subsequent 
yea \\\ . 

Criterion- referenced testing of itself will not remedy Mio 
unhealthy situation of foreign-language instruction (in t lie United 
Stales), Positive teacher expectancies are crucial. Adequate 
teaching interims and creative teaching arc further requisites, 
nut there Is nothing quite so heady as success: once entire classes 

realize they are actually mastering a second language rather than 
being slowly overwhelmed in a sea of incomprehension, motivation 
will increase, attitudes will improve, and students will finally 
l c learning a new language rather than just 'studying’ it. 
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